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Abstract 



The intra-host evolutionary and population 
dynamics of the human immunodeficiency 
virus type 1 (HIV-1), the cause of the acquired 
immunodeficiency syndrome, have been the 
focus of one of the most extensive study efforts 
in the field of molecular evolution over the past 
three decades. As HIV-1 is among the fastest 
mutating organisms known, viral sequence 
data sampled over time from infected patients 
can provide, through phylogenetic analysis, 
significant insights about the tempo and mode 
of evolutionary processes shaped by complex 
interaction with the host milieu. Five main 
aspects are discussed: the patterns of HIV-1 
intra-host diversity and divergence over time 
in relation to different phases of disease pro- 
gression; the impact of selection on the tempo- 
ral structure of HIV-1 intra-host genealogies 
inferred from longitudinally sampled viral 
sequences; HIV-1 intra-host sub-population 
structure; the potential relationship between 
viral evolutionary rate and disease progression 
and the central evolutionary role played by 
recombination occurring in super-infected 
cells. 



Introduction 

The human immunodeficiency virus type 1 
(HIV-1), a small diploid RNA retrovirus with a 
genome size of approximately 10,000 
nucleotide bases, was isolated for the first time 
about three decades ago and recognized as the 
etiologic agent of the acquired immunodefi- 
ciency syndrome (AIDS). 1 Early study soon 
demonstrated that HIV-1 displayed remarkable 
inter- and intra-host genetic heterogeneity, 2 4 
as well as specific evolutionary patterns within 
the infected host. 5 Three main factors con- 
tribute to the fast evolution of the virus. First, 
the low-fidelity of the reverse transcriptase 
characterized by an error rate of 3.4xl0~ 5 
mutations per nucleotide per cycleresulting in 
the introduction of one nucleotide substitution 
every second to third newly synthesized 



genome. 67 Second, the rapid viral generation 
time - defined as the time from release of a 
virion until it infects another cell and causes 
the release of a new generation of viral parti- 
cles - estimated in 1.5-2.6 days with a produc- 
tion of ~10 10 -10 12 new virions each day. 811 
Third, the elevated recombination rate with a 
number of crossovers between two RNA tem- 
plates estimated to range from three to nine 
per genome per round of replication. 12 14 These 
factors make HIV-1 one of the fastest evolving 
organisms known and the heterogeneous viral 
swarm that infects most HIV-lseropositive 
individuals fits the definition of a measurably 
evolving population, i.e. a population from 
which molecular sequences taken at different 
points in time will display a statistically signif- 
icant number of genetic differences. 15 In other 
words, HIV-1 evolution in vivo is an observable 
microevolutionary process that can be investi- 
gated through the analysis of viral sequences 
sampled longitudinally (heterochronous 
sequences) from an infected host. 

HIV-1 intra-host evolutionary and popula- 
tion dynamics have been under extensive 
investigation, 1617 especially for their potential 
role in pathogenic processes, 18 and because 
they constitute a major challenge for the devel- 
opment of an effective vaccine. 1921 
Nevertheless, two fundamental questions - 
what are the intra-host ecological factors shap- 
ing viral diversity over time and what is the 
potential relationship between viral evolution 
and pathogenesis - are still presently debat- 
ed. 16 

In the next sections, the current knowledge 
on HIV-1 intra-host evolutionary patterns in the 
absence of antiretroviral treatment (ART) will 
briefly be reviewed with special emphasis on 
results obtained through the development and 
application of phylogenetic techniques. I will 
show how phylogenetic analysis has been able 
to shed light on the complex interplay between 
viral evolution and immune selection that can 
ultimately result in disease progression. 

Patterns of HIV-1 intra-host 
diversity and divergence over 
time 

HIV-1 genetic variability at the nucleotide 
level can reach up to 5% within an infected 
subject and provides the viral population with 
the ability to adapt rapidly to changes in its 
environment. 16 The infection is typically char- 
acterized by three phases - acute infection, 
clinical latency and progression to AIDS - dur- 
ing which viral genetic divergence and diversi- 
ty display consistent patterns that have been 
associated with disease progression. 22 23 HIV-1 
intra-host diversity and divergence over time 



can be defined according to specific phyloge- 
netic criteria. 24 Given a data set of HIV-1 
sequences sampled longitudinally from an 
infected patient, diversity is the average pair- 
wise genetic (nucleotide or amino acid) dis- 
tance within the sequences sampled at a given 
time point, while divergence is their average 
genetic distance from the most recent common 
ancestor (MRCA), i.e. the root of the viral 
genealogy (phylogeny). 

When an individual becomes infected with 
HIV-1, a relatively homogenous population of 
the virus is initially harbored because trans- 
mission is usually associated with a signifi- 
cant population bottleneck. 25 27 A new infection 
can be established by just one single virion, 28 29 
although the number of variants transmitted 
seem to be dependent on the transmission 
route and bottlenecks associated with sexual 
transmission may be greater than with vertical 
transmission (e.g. mother to child or injecting 
drug use). 30 It has been shown that in men 
who have sex with men (MSM) the minimum 
number of transmitted viruses can range from 
2 to 10 with subsequent viral recombination 
leading to rapid and extensive genetic shuf- 
fling among viral lineages. 31 A combined analy- 
sis also revealed a significantly higher fre- 
quency of multivariant transmission in MSM 
than in heterosexuals. 31 The severity of trans- 
mission bottlenecks could be the result of 
anatomical barriers (e.g. mucosal barrier in 
heterosexual transmissions) and by immune 
selection mediated by early cytotoxic T cell 
responses. 32 However, in an animal model, 
infection with a highly heterogeneous SIV viral 
swarm of CD8 + depleted Rhesus macaques, 
where immune selection is expected to be 
weak or inexistent, low frequency SIV variants 
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characterized by specific amino acid motif in 
gpl20 were still transmitted more efficiently 
than higher frequency ones in six different 
primates. 33 Overall, it is important to keep in 
mind that these studies used, by necessity, 
indirect methods to quantify the amount of 
transmitted variants, because it is obviously 
very difficult to measure HIV-1 genetic diversi- 
ty at the exact moment of transmission. 
Indeed, it has rightly been observed that homo- 
geneity of the viral population during primary 
infection does not necessarily imply homo- 
geneity of the transmitted variants. 34 

The clinical latency phase of HIV-1 infection 
is characterized by a progressive increase in 
both viral divergence and diversity, while dur- 
ing the last phase, when the immune system 
collapses and progression to AIDS begins, viral 
divergence stabilizes and viral diversity 
declines. 23 This observation has been 
explained as a consequence of CD4+ T cell 
depletion, which likely results in less effective 
selection pressure on the virus, as well as sig- 
nificant decrease in target cells capable of sus- 
taining viral replication. 35 



Temporal structure of HIV-1 
intra-host genealogies 

Phylogenetic studies have clearly shown 
that, in the absence of ART, the central para- 
digm governing HIV-1 intra-host evolutionary 
dynamics following primary infection is the 
turnover of genetic variants over time through 
sequential population bottlenecks. 523 HIV-1 
intra-host genealogies of heterochronous 
sequences display strong temporal structure, 
where sequences from the same sampling 
time tend to cluster together and are the direct 
ancestors of sequences from the following 
time point. A typical example, using a data set 
of HIV-1 env gpl20 V1V3 sequences from a 
pediatric patient infected via mother to child 
transmission, 36 is given by the maximum like- 
lihood (ML) tree in Figure 1A. Increasing 
branch lengths from root to the latest sampled 
tips also reveal the increase in both viral diver- 
sity and divergence over time. The key point is 
that topology and branch length patterns of 
HIV-1 intra-host genealogies are the result of 
specific phylodynamic processes, defined as 
the interaction between evolutionary {i.e. 
mutation, genetic drift, selection) and ecologi- 
cal (population dynamics and environmental 
stochasticity) factors. 37 In particular, the strik- 
ing similarity between HIV-1 intra-host and 
influenza A inter-host genealogies, both char- 
acterized by strong temporal structure, seem to 
indicate that HIV-1 population bottlenecks may 
be driven by continual immune selection. 37 
Studies based on pairwise sequence compar- 



isons of synonymous (dN) and nonsynony- 
mous (dS) substitutions have suggested that 
neutral genetic drift is the dominant force 
shaping the intra-host evolution of the 
virus. 38,39 However, such methods have the ten- 
dency to underestimate the strength of selec- 
tion. 40 Analyses of dN/dS ratios using an ML 
framework that takes into account the phyloge- 
netic tree relating the sequences under study, 
have shown that both purifying and positive 
selection play a substantial role in the contin- 
uous emergence in vivo of new variants and 
the ability of the virus to evade immune 
response. 364142 HIV intra-host adaptation rate 
in the env gene has, indeed, been estimated to 
the astounding value of one adaptive fixation 
event every ~2.5 months. 43 Moreover, the com- 
parison of HIV-1 replication rate of virus 
genomes isolated at early times following 



infection with that of later viruses has demon- 
strated an increase of fitness during chronic 
infection, 44 and escapes from both CD8+ T-cell 
responses and neutralizing antibodies - major 
driving forces of rapid lineage turnover - are 
well documented. 45 " 17 

The degree of temporal structure exhibited 
by HIV-1 genealogies of longitudinally sampled 
sequences can be affected by additional factors 
that can alter the ideal ladder-like topology. 
The ML tree in Figure IB is an example 
obtained by analyzing gag p24 viral sequences 
from a subject carrying the HLA class I allele 
B*5701, 48 which is associated with slower dis- 
ease progression. 49 By applying a nonparamet- 
ric test for population structure, it has been 
shown that a complete within host viral popu- 
lation turnover may take up to 22 months. 50 
Therefore, the intermix of viral sequences at 70 




Figure 1. Maximum likelihood genealogies of HIV-1 intra-host sequences sampled over 
time from two different subjects. The trees were obtained with the HKY+G model. Branch 
lengths are scaled in nucleotide substitutions per site according to the bar at the bottom 
of each tree. The number after the @ sign in the sequence label indicates the sampling 
time in days post infection. Sequence names from different time points are displayed in 
different colors for clarity. The colored circles highlights the internal nodes representing 
the most recent common ancestor of the viral population sampled at a given time point. 
The root was inferred by choosing the rooted topology that resulted in the best linear 
regression between branch lengths and sampling times. The tree for subject S4 (left) 
includes a subset of env gpl20 VI V3 sequences described in Salemi et al. 36 (2007) from a 
pediatric patient infected through mother to child transmission. The tree for subject P5 
(right) includes a subset of gag p24 sequences described in Norstrom etaL 4 * (2012) from 
a patient carrying the HLA-B*5701 allele with low risk of disease progression. Sequences 
sampled at a later time point clustering with sequences sampled earlier are indicated by 
arrows and may reveal archival genomes from viral reservoirs. The alignments used in the 
analysis are available from the author upon request. 
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and 106 days post infection shown in Figure IB 
is expected and typical of genealogies inferred 
from longitudinal samples with relatively short 
time intervals. Other deviations, however, can 
provide critical insights on factors driving HIV- 
1 intra-host population dynamics. For example, 
the clustering of sequences sampled at more 
distant time points, like the one indicated by 
the arrows in Figure IB, within a monophylet- 
ic clade of earlier sequences may indicate the 
activation (or re-activation) of archival 
genomes, possibly from a viral reservoir. 
Peripheral blood mononuclear cells have been 
noted to harbor small latent reservoirs of infec- 
tious virus, 5152 which seem to provide one of 
the mechanisms for lifelong persistence of 
HIV-1 under ART. 53 Although phylogeny-based 
criteria have been proposed to detect reser- 
voirs where viral replication is restricted, 54 a 
full characterization of HIV- 1 reservoirs is still 
lacking. From a phylogenetic perspective, a 
limiting factor is that topological differences 
among genealogies can intuitively be assessed 
by visual inspection (Figure lAand B), but are 
actually difficult to quantify. A new statistic, 
Temporal Clustering (TC), has recently been 
developed to provide a quantitative measure of 
the degree of topological temporal structure in 
a serially sampled genealogy. 55 56 TC values rep- 
resent the expected deviation of an observed 
genealogy from the null hypothesis of no tem- 
poral structure (TC=0) and can be used to 
compare trees of different size. The tree with 
perfect temporal structure in Figure 1A has by 
definition TC=1, while the tree in Figure IB 
has TC=0.72. 56 In future studies, comparison 
of HIV-1 intra-host genealogies from patients 
sampled under different conditions could pro- 
vide important insight on the dynamic of viral 
reservoirs. In addition, the ability to quantify 
and compare topological deviations from per- 
fect temporal (ladder-like) structure among 
different trees may shed light on qualitative 
and quantitative differences in the evolution- 
ary and ecological processes shaping each phy- 
togeny. 



HIV-1 intra-host effective 
population size and metapopu- 
lation structure 

Strong evidence for the importance of selec- 
tion on HIV-1 intra-host evolution is also pro- 
vided by studies on the total viral population 
size within an infected host, estimated on the 
order of 10 10 individual viruses. 5758 According 
to the general prediction of evolutionary theo- 
ry, mutations in small size populations are pro- 
duced rarely and their fixation will essentially 
depend on random genetic drift. On the other 
hand, in populations large enough that can be 



considered for any practical purpose of infinite 
size, mutations are produced more frequently 
and their fixation will eventually be decided by 
the deterministic action of natural selection. It 
has been suggested that it would be appropri- 
ate to consider the HIV-1 population within an 
infected host as if its size were infinite and, 
therefore, evolutionary driven by selection. 22 
On the other hand, several studies have calcu- 
lated that HIV-1 intra-host effective population 
size (Ne), corresponding to the size of a hypo- 
thetical neutral idealized population (i.e. 
described by the standard Wright-Fisher 
model), is several orders of magnitude smaller 
(10 3 -10 4 ) than the total population size. 10 ' 39 ' 50 ' 59 
Such estimates would be more consistent with 
an evolutionary process driven by random 
genetic drift, despite the strong evidence of 
selection provided by dN/dS analyses (see pre- 
vious section). At least two alternative expla- 
nations have been proposed to reconcile the 
discrepancy. First, Ne estimates in these stud- 
ies were derived from methods implicitly 
assuming neutrality, making it impossible to 
assess whether low values reflect true stochas- 
tic processes or continual selective sweeps. 34 
Second, HIV-1 populations do not evolve under 
panmixia (one single population), but rather 
with some metapopulation structure (i.e. divi- 
sion in sub-populations) that causes a reduc- 
tion in Ne. 50 Indeed, several studies have 
shown the existence of diverse HIV-1 subpopu- 
lations infecting different body tissues, for 
example semen, 60,61 breast milk, 62,63 genital 
tract, 64 and brain. 65 68 The term compartments 
has been introduced to indicate tissues or cell 
types were viral replication is ongoing but viral 
gene-flow (in and out) is restricted because of 
anatomical barriers. 24 54 Compartmentalization 
is shown, phylogenetically, by the existence of 
supported monophyletic clades including 
sequences isolated from a specific tissue, as 
displayed in Figure 2. The ML tree was inferred 
from HIV-1 sequences isolated from necropsy 
tissues of an infected subject who died with 
widespread atherosclerosis. 69 Additional stud- 
ies have also demonstrated that different brain 
regions harbor distinct viral populations char- 
acterized by specific evolutionary dynamics 70,71 
that may be associated with pathogenic 
processes like the onset of HIV-1 associated 
dementia. 72,73 



Viral evolutionary rate 
and disease progression 

Estimates of HIV-1 intra-host evolutionary 
rate range from 10 3 to 10 4 nucleotide substitu- 
tions per site per year depending on specific 
patients and gene region analyzed. 16,74 Several 
studies have shown an inverse relationship 



between rate of viral evolution and disease 
progression, 75 80 but contrary evidence has also 
been presented. 81,82 A sophisticated molecular 
clock analysis of gpl20 C2V5 sequences from 
nine subjects followed from seroconversion up 
to 11 years found that HIV-1 disease progres- 
sion seems to be predicted by synonymous 
substitution rates, which are expected to be 
selectively neutral and, therefore, proportional 
to the underlying viral replication rate. 83 
Specifically, lower substitution rates were 
found in patients with slow disease progres- 
sion. The authors speculated that such tower 
rates might reflect lower levels of persistent 
immune activation, which in turn impose sig- 
nificant constraints on viral generation times 
and viral evolution. T cell activation is the 
strongest predictor of progression to AIDS, 84,85 
and can determine the rate and continuity of 
viral replication. 86 

In the attempt to unify all these observa- 
tions, Lee et a/. 35 in 2008 developed an HIV-1 
intra-host evolution model that simulated the 
effects of mutation and fitness of sequence 
variants. The model indicated that the 
decrease of diversity and stabilization of diver- 
gence in the last phase of the infection are the 
result of a decrease in the proportion of off- 
spring that are mutants as the distance from 
the founder strain increases, rather than of an 
increase in viral fitness. In other words, HIV-1 
intra-host evolutionary rate is expected to 
decrease towards the end of the infection in 
correlation with the rate of CD4 + T cell decline 
because of the progressive disappearance of 
target cells that can sustain viral replication. 
Phylogenetic and molecular clock analysis of 
empirical data (15 patients followed for 3-12 
years) supported the prediction of the model. 35 

Interestingly, a small proportion of HIV-1 
infected patients (<1%), known as elite con- 
trollers or long-term nonprogressors (LTNPs), 
exhibit viral load (VL) below the detection 
limit of conventional assays (<50 copies/mL 
plasma) and can maintain CD4+ T cell counts 
of >500 cells/mm 3 for more than 10 years with- 
out any therapy. 87 Most LTNPs carry the HLA- 
B*5701 allele and their ability to control the 
infection have intensively been studied for its 
obvious clinical implications. 49,87,88 From the 
standpoint of virus evolution, little work has 
been done since intra-host genetic variation is 
often minimal and difficult to evaluate given 
the undetectable VL. However, some subjects 
expressing HLA-B*5701 do show low levels of 
viremia (usually ranging between 1000 and 
10,000 copies/mL) and can still remain clini- 
cally and/or immunologically stable for years 
without therapy. 89,90 CD8+ T cell responses in 
these patients target several epitopes in the 
gag p24 viral gene, 91 93 and an interesting asso- 
ciation between HIV-1 intra-host evolutionary 
rate and immune response has recently been 
found. 48 Heterochronous p24 sequence data 
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were employed to estimate with a Bayesian 
molecular clock method the posterior distribu- 
tion of viral evolutionary rates within subjects 
classified as either high-risk progressor (HRP) 
or low-risk progressor (LRP) on the basis of 
their CD4 + T cell count in early (10-13 weeks) 
infection. 48 Lower evolutionary rates were 
found in LRPs who also exhibited higher poly- 
functional responses than HRPs in terms of 
higher proportion of CD8 + T cells producing a 
combination of interferon-y, inteleukin-2, 
MIP-1 and perforin when induced by wild-type 
HLA-B*5701-restricted epitopes. Further 
investigation of the interplay between specific 
CD8 + T cell responses and intra-host viral rate 
of evolution in these subjects may help defin- 
ing correlates of protection, one of the greatest 
challenges in HIV-1 vaccine design. 



The phylogenetic challenge 
of HIV-1 recombination 

Due to super-infection and reverse tran- 
scriptase switches between alternative genom- 
ic templates, that are an integral part of HIV 
lifecycle, HIV-1 recombination rate per replica- 
tion is likely exceeding the mutation rate. 94,95 
The impact of recombination on viral genetic 
diversity is evident at the molecular epidemio- 
logical scale. The majority of HIV-1 strains cir- 
culating worldwide cluster within a large group 
called M (for Main) that includes, besides ten 
phylogenetically distinct subtypes and two sub- 
subtypes, several inter-subtype recombinants, 
known as circulating recombinant forms 
(CRFs). 9697 Recombination can affect HIV-1 
evolutionary dynamics by providing a mecha- 
nism even more efficient than fast mutation 
rate for the virus to escape from the accumula- 
tion of deleterious mutations or to jump 
between adaptive peaks. 16 It might accelerate 
progression to AIDS and can foster the emer- 
gence of viral variants able to evade immune 
pressure or with increased resistance to 
drugs. 98 100 Analysis of HIV-1 sequences ampli- 
fied from different tissues, sampled post 
mortem from patients with a number of AIDS- 
related malignancies, have shown that recom- 
bination is common and more extensive in tis- 
sues with abnormal histopathology compared 
to normal tissues. 101 

As shown in Figure 3, intra-host recombina- 
tion could be detected, in principle, by moving 
a sliding window along a multiple sequence 
alignment and calculating the bootstrap sup- 
port for the phylogenetic clustering of a query 
sequence, the potential recombinant, with dif- 
ferent reference sequences in different 
genomic regions. This analysis is an example 
of bootscanning; a method specifically devel- 
oped to investigate recombination in HIV. 102 



Recombination violates the basic assumption 
of phylogeny inference of descent from a com- 
mon ancestor. Recombinant sequences cannot 
be clustered correctly within one single phylo- 
genetic tree but can be displayed within net- 
work-like graphs able to represent simultane- 
ously different tree topologies (Figure 3). 



Using algorithms that do not explicitly 
model recombination can bias molecular clock 
as well as population genetic estimates. 103104 
HIV-1 evolutionary dynamics should therefore 
be studied with methods that take recombina- 
tion into account. Unfortunately, this is not the 
case for most of the work discussed in the pre- 
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Figure 2. Maximum likelihood genealogy of HrV-1 gpl20 V3C4 sequences amplified 
from necropsy tissues of a patient with widespread atherosclerosis. The tree includes a 
subset of the sequences obtained from subject AZ described in Lamers etal. 6i (2011), and 
was inferred using the HKY+G model. Branch lengths are scaled in nucleotide substitu- 
tions per site according to the bar at the bottom and colored to highlight the tissue of ori- 
gin according to the legend in the figure. The tissue of origin of the internal branches was 
inferred by maximum parsimonious reconstruction of ancestral states. Numbers along the 
branches represent percent bootstrap values (1000 replicates). The alignment used in the 
analysis is available from the author upon request. 
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Figure 3. Putative recombination analysis of HIV-1 intra-host sequences sampled from 
different tissues as indicated by the color legend in the figure. A query sequence is com- 
pared to a set of reference sequences by a sliding window analysis, which calculates the 
bootstrap support for the clustering of the query sequence with a specific reference 
sequence along the genome. In this cartoon example, a query sequence from plasma clus- 
ters with high bootstrap support with a reference sequence from peripheral blood 
mononuclear cells (PBMCs) in the 5' end of the genome, while with a sequence from thy- 
mus in the 3' end. The two trees are schematically illustrated within the frame and the net- 
work shows how they could be represented by a cyclic graph where the recombinant 
sequence is simultaneously connected to the putative parental strains. 
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vious sections, where authors have chosen 
either to ignore recombination or the common 
approach to identify recombinants and exclude 
them from the analysis. The latter strategy has 
two important caveats. First, because recombi- 
nation is so frequent and plays such a central 
role, inferences on the evolutionary dynamics 
of the virus after exclusion of recombinants 
may be meaningless or, at the very least, 
incomplete. Second, despite the plethora of 
methods developed to detect recombination, 
each algorithm has limitations resulting, in 
general, in an under estimation of recombina- 
tion, especially in data sets of closely related 
sequences. 103 104 For example, while inter-sub- 
type recombination is relatively straightfor- 
ward to detect, 105 bootscanning performs very 
poorly with HIV-1 sequences from individuals 
infected by the same subtype. Network-based 
methods have been more successful and have 
shown that the number of recombinant 
sequences detected by classic phylogenetic 
analysis can lead to a significant under esti- 
mation of the actual number of recombinants 
within an intra-host data set. 106 107 However, it 
is clear that novel techniques and additional 
studies are still needed to assess the full 
impact of recombination on HIV-1 intra-host 
evolutionary and population dynamics. 



Final remarks 

This review briefly highlights some of the 
major findings that have contributed to our 
current understanding of HIV-1 intra-host phy- 
lodynamics. There is general agreement on 
deterministic selection processes as the main 
driving force of viral evolution within the 
infected host. The study of the temporal struc- 
ture of genealogies inferred from heterochro- 
nous sequences is one of the key techniques 
that can be used to assess the impact of contin- 
ual immune selection on viral population 
turnover and a promising tool to investigate 
further the activation or re-activation of viral 
reservoirs. In addition, the recent application 
of high-resolution deep sequencing to study 
HIV-1 population complexity and structure 
(biodiversity) within an infected host can 
allow the characterization of low frequency 
variants responsible for significant changes in 
the viral evolutionary landscape that would be 
impossible to detect by conventional meth- 
ods. 108 On the other hand, a thorough under- 
standing of early evolutionary events at the 
moment of transmission and during primary 
infection, as well as a full account of the role 
played by recombination is still lacking and 
among the future challenges of HIV molecular 
evolutionary research. 
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